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EXT-036 

METHODS FOR DISEASE DETECTION 
Background of the Invention 

Many diseases are associated with genomic instability. That is, a disruption 
in genomic stability, such as a mutation, has been linked to the onset or progression 

5 of disease. Accordingly, various aspects of genomic instability have been proposed 
as reliable markers for disease. For example, mutations in the BRCA genes have 
been proposed as markers for breast cancer, and mutations in the p53 cell cycle 
regulator gene have been associated with numerous cancers, especially colorectal 
cancer. It has been suggested that specific mutations might be a basis for molecular 

10 screening assays for the early stages of certain types of cancer. See, e.g., 
Sidransky, etal., Science, 256: 102-105 (1992). 

The search for genomic disease markers has been especially intense in the 
area of cancer detection. Cancer is characterized by uncontrolled cell growth which 
can be associated with one or more genetic mutations. Such mutations can cause 

15 the affected cells to avoid cell death. For example, a mutation in a tumor suppressor 
gene can cause cells to avoid apoptosis - a type of cell death thought to be under 
direct genetic control. During apoptosis, cells lose their membranes, the cytoplasm 
condenses, and nuclear chromatin is split into oligonucleotide fragments of 
characteristically short length. In fact, those characteristic DNA cleavage patterns 

20 have been proposed as an assay for apoptosis. 

Attempts have been made to identify and use nucleic acid markers that are 
indicative of cancer. However, even when such markers are found, using them to 
screen patient samples, especially heterogeneous samples, has proven 
unsuccessful either due to an inability to obtain sufficient sample material, or due to 

25 the low sensitivity that results from measuring only a single marker. Simply 
obtaining an adequate amount of human DNA from one type of heterogeneous 
sample, stool, has proven difficult. See Villa, et al., Gastroenterol., 110: 1346-1353 
(1996) (reporting that only 44.7% of all stool specimens, and only 32.6% of stools 
from healthy individuals produced sufficient DNA for mutation analysis). Other 

30 reports in which adequate DNA has been obtained have reported low sensitivity in 
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identifying a patient's disease status based upon a single cancer-associated 
mutation. See Eguchi, et al., Cancer, 77: 1707-1710 (1996) (using a p53 mutation 
as a marker for cancer). 

Investigators have attempted to analyze mutations in DNA of tumor cells shed 
5 into luminal areas, such as the colon, bile ducts, blood vessels and the like. Such 
attempts have only been successful when there is a known mutation and a relatively 
high concentration of cellular material has been found. See e.g., Mulcahy, et al., 
Ann. Oncol. 10 Suppl 4:1 14-1 17 (1999). No attempts have been made to correlate 
disease status with DNA integrity in shed cellular material. 

10 

Summary of the Invention 

The present invention provides that the integrity of nucleic acids, proteins, 
and/or other cellular components in a biological sample comprising shed cellular 
material indicates the disease status of the patient from whom the sample was 

15 obtained. According to the invention, tissue or body fluid samples, especially those 
described below, contain shed cellular debris. In healthy patients, such debris is the 
result of apoptotic degradation as part of the normal cell cycle. Apoptosis reduces 
the integrity (intactness) of nucleic acids, proteins, and other cellular components in 
healthy individuals, so that only small fragments exist in the debris that results from 

20 the apoptotic process (e.g., exfoliated cellular debris). To the contrary, in diseases 
such as cancer in which cell cycle mechanisms are destroyed or impaired, cellular 
debris comprises high-integrity cellular components, such as nucleic acids (i.e., 
nucleic acids that have not been degraded by apoptosis ). 

Methods of the invention comprise using the integrity of cellular components 

25 as a measure of patient disease status. Integrity is measured as any indication of 
the presence of intact cellular components (e.g., length, molecular weight, 
secondary, tertiary, quaternary structure, etc.) According to methods of the 
invention, a tissue or body fluid specimen containing sloughed cellular debris 
obtained from a patient having a disease contains an amount of intact cellular 

30 components (e.g., nucleic acid) that is greater than would be expected in such a 
specimen obtained from a healthy patient. Thus, a measure of intact nucleic acid, 
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proteins, or other cellular components in a patient sample is indicative of the overall 
disease status of the patient. The invention is equally applicable to human and to 
veterinary uses. Accordingly, "patient" as defined herein means humans or other 
animals. 

5 A healthy patient generally produces cellular debris through normal apoptotic 

degradation, resulting in relatively small fragments of cellular components in tissue 
and body fluid samples, especially luminal samples. Patients having a disease 
generally produce cells and cellular debris, a proportion of which has avoided normal 
cell cycle regulation, resulting in relatively large cellular components. Without being 

10 held to theory, the present invention takes advantage of this and other insights 
concerning the ways in which cells respond to diseases, especially diseases 
associated with genetic abnormalities (either induced or inherited). As a result, it 
has been discovered that the disease status of a patient is determined by analysis of 
patient cellular components produced in specimens obtained from the patient. Most 

15 preferably, such specimens are those most likely to contain sloughed cellular debris. 
Such specimens include, but are not limited to, stool, blood serum or plasma, 
sputum, pus, colostrum, and others. In diseases, such as cancer, in which genomic 
instabilities or abnormalities have interfered with normal cell cycle regulation, 
specimens such as those identified above contain relatively intact fragments of 

20 cellular components. The presence of such fragments is a general diagnostic 
screen for disease. Any cellular component(s) may be used in methods described 
herein, including, but not limited to, nucleic acids, proteins, carbohydrates, sugars, 
membranes, lipids, receptors, and the like. 

Accordingly, methods of the invention comprise screening a patient for 

25 disease by analysis of the integrity of cellular components in a tissue or body fluid 
specimen obtained from the patient. Preferred specimens include those comprising 
shed cells or cellular debris. Such preferred specimens comprise stool, sputum, 
urine, bile, pancreatic juice, and blood serum or plasma, all of which contain shed 
cells or cellular debris. Methods of the invention are useful as general disease 

30 screens, and are especially useful as screens for cancer. Cancer is an example of a 
disease thought to be associated with genomic instabilities, and specifically with the 
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loss of control over the normal cell cycle. Thus, tumor cells are typically intact and 
routinely are shed into, for example, stool, sputum, urine, bile, pancreatic juice, and 
blood. Such shed cells and cellular debris contain higher integrity nucleic acids and 
other cellular components compared to those found in specimens obtained from a 
5 healthy patient. There are numerous ways in which the integrity of cellular 
components in a patient specimen are measured as a screen for disease. 

In a preferred embodiment, disruption of the integrity of cellular components 
is measured in a stool sample. According to the invention, shed cells and cellular 
debris indicative of disease are shed by various tissues/organs, and eventually 

10 appear in the forming stool. Accordingly, an analysis of stool provides a diagnostic 
screen for disease generally as shown in Example 5 below. Other samples, such as 
blood, urine, sputum, etc. also provide a view of the overall health status of the 
patient. Follow-up testing is used to further diagnose and treat the disease. 

In a preferred embodiment, the cellular component used as a measure of 

15 disease is a nucleic acid. Nucleic acid integrity preferably is measured by the ability 
to amplify nucleic acids in a sample. However, integrity is also measured by 
sequencing, mass spectrometry, X-ray, etc. A preferred method comprises 
conducting in a tissue or body fluid sample an amplification reaction using as a 
template a nucleic acid locus suspected to be in the sample. If the amount of 

20 amplification product (amplicon) is greater than the amount of amplicon expected to 
be present in a normal sample (e.g., one not having the disease being screened), 
the sample is determined to be positive. In some cases, the presence of any 
amplification product is sufficient to justify a positive screen for disease. It is 
preferable that, in the case of DNA, the amplification reaction is a polymerase chain 

25 reaction (PCR) or, in the case of RNA, that the amplification reaction is reverse 
transcriptase PCR. Primers are designed to amplify the locus or loci chosen for 
analysis. For purposes of the invention a "genomic locus" is any genetic element, 
including but not limited to a coding region of a gene, a non-coding nucleic acid 
region, a regulatory element of a gene, an intron or RNA. It is not required that the 

30 target genomic loci be associated with any specific disease, as an increase in 
amplifiable nucleic acid is itself diagnostic. 
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In one preferred embodiment, the presence of a single high molecular weight 
amplicon is a positive screen. Preferably, a fragment of about 1 .3 Kb or greater is 
measured as an indicator of high integrity nucleic acids in the patient sample. 

In a highly-preferred embodiment, a profile of amplification products across a 
5 range of nucleic acid fragments of different lengths is produced. In a preferred 
embodiment, a series of amplification reactions is conducted at a single genomic 
locus, each reaction being designed to amplify a fragment of unique length. If 
detectable amplicon is produced in each reaction, or in a number of reactions 
greater than expected in a sample obtained from a healthy patient, the sample is 
10 determined to be positive. For example, attempts are made to amplify fragments of 
200 bp, 400 bp, 800 bp, 1.3 Kb, 1.8 Kb, and 2.4 Kb at the same genomic locus. In a 
sample obtained from a healthy individual (a "normal" sample), it would be expected 
that little or no amplification product is observed, especially when the longer portions 
of the locus are used as the template. To the contrary, at least some proportion of 
15 cells and cellular debris in a sample obtained from a diseased patient will contain 
intact fragments. 

In another embodiment, a profile of amplification products across a range of 
nucleic acid fragments of different lengths is produced by a series of amplification 
reactions conducted on a series of different genomic loci, each reaction being 

20 designed to amplify a fragment of unique length. If detectable amplicon is produced 
in each reaction, or in a number of reactions greater than expected in a sample 
obtained from a patient not having the disease being screened, the sample is 
determined to be positive. 

According to methods of the invention, normal samples do not produce 

25 significant amounts of detectable amplicon at any length significantly greater than 
the typical apoptotic fragment (about 175 bp). Accordingly, whether primers are 
spaced to amplify fragments of only one length at a given genomic locus, or whether 
a series of amplifications at the locus are conducted, differences are readily 
observable between normal and diseased samples. 

30 As detailed below, methods of the invention are useful to detect disease in 

biological samples comprising shed cells or cellular debris. For example, the 
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presence in a patient stool sample of amounts of nucleic acid, preferably DNA, 
above a predetermined threshold for healthy patients is indicative that the patient 
has a disease. Follow-up analysis is used to determine where the disease resides. 
However, the general disease screen is effective independent of the locus of the 

5 disease and the specimen taken for analysis. Thus, while the analysis of nucleic 
acids in stool is predictive of disease generally, it does not necessarily indicate that 
the disease is of gastrointestinal origin. However, follow-up screening based, for 
example, on mutational analysis, is adequate to identify the locus of disease. 
Numerous mutational analyses are known in the art and include, for example, U.S. 

10 Patent No. 5,670,325, incorporated by reference herein. Moreover, the intensity of, 
for example, bands on nucleic acid gels may be indicative of the locus of disease. 

Also in a preferred embodiment, methods of the invention are used to monitor 
the progress of a disease in a patient or in populations of patients. Such longitudinal 
monitoring provides information on the degree to which integrity of cellular 

15 components is increasing or decreasing as disease progresses or recedes. Such 
longitudinal monitoring can be used to assess the efficacy of treatments (e.g, 
chemotherapy, antibiotics), and the response of patients to therapeutic interventions. 
Methods of the invention can also be used to predict disease flare-up. For example, 
monitoring fluctuations in patient nucleic acid integrity in diseases, such as 

20 inflammatory bowel disease, is useful to predict onset of disease episodes. 

According to the invention, episodic occurrence of symptoms is tied to an increase in 
high-integrity cellular components, such as nucleic acids. 

Methods of the invention are also useful to establish patient databases. Such 
databases are used to identify specific patients, to establish where a particular 

25 patient fits in a disease continuum (based upon the amount of high integrity cellular 
components present in the patient sample as compared to the database in order to 
assist in diagnosis), to follow trends in disease, to predict disease onset, or to 
compile statistics on disease frequency, to monitor patient progress and treatment 
efficacy, and the like. 

30 Methods of the invention are also useful to predict risk for disease and to 

predict disease onset. Levels of cellular component integrity are useful as a 
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quantitative or quasi-quantitative measure of disease. Thus, the level of, for 
example, high integrity nucleic acid obtained from a patient sample is compared to 
standards representing various stages of disease in order to assess the patient's 
disease state and prognosis. 
5 Methods of the invention are also useful to monitor viral and bacterial load. 

Target bacterial or viral nucleic acids or proteins are isolated and analyzed using the 
molecular weight profiles described herein in order to characterize the state of 
disease of the patient sample. Thus, methods of the invention are useful to screen 
patients for HIV, and to monitor its progress. 

10 Methods of the invention are also useful to screen for stroke, heart attack, 

asthma, and arthritis. Those conditions each result in the shedding of cells and 
cellular debris comprising intact or high-integrity cellular components produced by 
means other than apoptosis. 

In an alternative embodiment, screening of patient samples by detecting 

15 amounts of nucleic acid in the sample is combined with an assay for apoptotic cell 
activity. Such assays may be combined with detecting amounts of nucleic acid in a 
patient sample as a screen for disease status. A positive screen is one that 
produces both: (1) an amount of nucleic acid that is greater than the amount 
expected to be present in a normal sample (e.g., one not having the disease being 

20 screened), and (2) an amount of apoptotic cell activity that is less than that expected 
to be present in a normal sample. In a highly preferred embodiment, methods of the 
invention comprise analyzing a plurality of genomic loci to determine an amount of 
amplifiable nucleic acid present at each locus. Analysis across multiple loci using 
methods of the invention may increase the sensitivity of the screening assay. 

25 As will be exemplified in detail below, methods of the invention comprise 

screening a biological sample for an abnormality in a nucleic acid by conducting an 
amplification reaction using as a template a nucleic acid suspected or expected to 
be in the sample; determining an amount of amplification product obtained; 
comparing the amount of amplicon obtained to a standard amount of amplification 

30 product; and identifying a sample as having an abnormality in a nucleic acid if the 
amount of amplification product differs from the standard amount. In a preferred 
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embodiment, a standard amount of amplification product is determined by 
amplification of a locus, or portion thereof, being screened (e.g., an intact, wild-type 
nucleic acid) in a known normal sample (one obtained from an individual known not 
to have the disease being screened). Also in preferred embodiments, a standard 
5 amount is determined by reference to the art. In certain embodiments of the 

invention, the standard amount is essentially no detectable amplicon due to the lack 
of high-integrity nucleic acids in the sample. Accordingly, any detectable amplicon in 
a patient sample is indicative of a positive screen. That is the case especially when 
a large (e.g., 1.8 Kb or 2.4 Kb) fragment is being screened. Finally, the standard 

10 amount can be a molecular weight marker on, for example, an electrophoretic gel. 

In a preferred embodiment of the invention, the sample is prepared from a 
specimen selected from the group consisting of stool, sputum, blood, urine, 
cerebrospinal fluid, seminal fluid, saliva, breast nipple aspirate, and biopsy tissue. 
However, any tissue or body fluid specimen may be used according to methods of 

15 the invention. Especially preferred are samples of luminal fluid because such 
samples are generally free of intact, healthy cells. Such samples include blood, 
urine, bile, pancreatic juice, stool, sputum, pus, and the like. 

Also in a preferred embodiment, the nucleic acid or nucleic acids being 
interrogated is (are) DNA. In a more particular embodiment, the nucleic acid being 

20 analyzed is selected from a coding region of a gene, or portion thereof, a noncoding 
nucleic acid region, or portion thereof, a regulatory element of a gene or a portion 
thereof, and an unidentified fragment of genomic DNA. Also in a preferred 
embodiment, the nucleic acid being interrogated is RNA. As is appreciated by the 
skilled artisan, any genomic locus is amenable to screening according to the 

25 invention. The particular locus or loci chosen for analysis depends, in part, on the 
disease being screened, and the convenience of the investigator. It is not necessary 
that the locus or loci chosen for analysis be correlated with any specific disease 
because methods of the invention contemplate measuring either the total nucleic 
acid in a sample or amplifiable nucleic acid in a sample as an indicator of overall 

30 disease status or the presence and/or extent of apoptosis in the sample. However, 
disease-associated loci (those in which a mutation is indicative, causative, or 



-8- 



otherwise evidence of a disease) can be used. Preferred disease-associated loci 
include p53, ape, MSH-2, dec, scr, c-myc, B-catnenin, mlh-1, pms-1, pms-2, pol- 
delta, and bax. 

The amount of amplification product may be determined by any suitable or 
5 convenient means. Preferably, the amount of amplification product is determined by 
gel electrophoresis. Labels, such as fluorescent or radioactive labels, may be used. 

The amounts of amplification product produced may be compared to standard 
amounts by any suitable or convenient means, including, but not limited to visual 
comparison, machine-driven optical comparison, densitometry, mass spectroscopy, 

10 hybrid capture, and other known means. The amplification reaction itself can be any 
means for amplifying nucleic acid, including, but not limited to PCR, RT-PCR, OLA, 
rolling circle, single base extension, and others known in the art. The amplification 
product can also be measured by signal amplification techniques, such as branch 
chain amplification (Chiron). Methods of the invention are useful with any platform 

15 for the identification, amplification, sequencing, or other manipulation of nucleic 
acids. For example, methods of the invention can be applied to ligase chain 
reaction, strand displacement (Becton-Dickinson), and others. 

Also in a preferred embodiment of the invention, a series of amplification 
reactions is conducted on a single genomic locus. Each amplification reaction in the 

20 series is designed to amplify a fragment of a different length. In a preferred 

embodiment, the target fragment lengths are 200 bp, 400 bp, 800 bp, 1.3 Kb, 1.8 Kb, 
and 2.4 Kb. Primers for amplification are designed according to knowledge in the art 
in order to amplify template, if present, of the desired length at the desired locus. A 
positive screen is one that produces amplicon in at least one, and preferably at least 

25 two of the series of amplification reactions. As noted above, a normal sample which 
has undergone or which is undergoing apoptosis typically contains little or no 
fragments of significant length. Thus, a series of amplification reactions targeting 
fragments from about 200 bp to about 2.4 Kb and longer reveals samples that 
contain nucleic acids that have avoided apoptosis as evidenced by the amplification 

30 of large fragments. 
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Preferred methods of the invention also comprise conducting amplification 
reactions on a series of different genomic loci. Preferably, from about 2 to about 7 
loci are used. However, the precise number of interrogated loci is determined by the 
individual investigator based upon the disease to be detected or based upon 
5 convenience. According to methods of the invention, primers are designed to 
amplify nucleic acid (preferably DNA) at each of the chosen loci. A sample in which 
at least one locus, preferably at least two loci, and most preferably at least three loci 
produce detectable amplification product is considered a positive sample. The 
lengths of fragments to be amplified in this assay may be varied, but are preferably 

10 at least about 180 bp each in length. It is not necessary that the same length 
fragments be amplified at each of the chosen loci. 

Methods of the invention also comprise conducting a series of amplification 
reactions at a series of different genomic loci. Each amplification reaction in the 
series is designed to amplify a fragment of a different length. Preferably, from about 

15 2 to about 7 amplification reactions on about 2 to about 7 loci are used. However, 
the precise number of interrogated loci is determined by the individual investigator 
based upon the disease to be detected or based upon convenience. In a preferred 
embodiment, the target fragment lengths are 200 bp, 400 bp, 800 bp, 1 .3 Kb, 1 .8 Kb, 
and 2.4 Kb. Primers for amplification are designed according to knowledge in the art 

20 in order to amplify template if present. It is preferred, but not necessary, that the 
same length fragments be amplified at each of the chosen loci. A positive screen is 
one that produces amplicon in at least one, and preferably at least two of the series 
of amplification reactions and in which at least one locus, preferably at least two loci, 
and most preferably at least three loci produce detectable amplification product. As 

25 noted above, a normal sample which has undergone or which is undergoing 

apoptosis typically contains little or no fragments of significant length. Thus, a series 
of amplification reactions targeting fragments from about 200 bp to about 2.4 Kb and 
longer reveals samples that contain nucleic acids that have avoided apoptosis as 
evidenced by the amplification of large fragments. 

30 Methods of the invention may also be used to assess the integrity of cellular 

components in a biological sample. For example, using DNA as the component to 
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be measured, such methods comprise conducting an amplification reaction using at 
least two loci suspected to be in the sample as templates; determining which loci 
produce detectable amplicon; and assessing the integrity of DNA in the sample as a 
function of the number of loci producing amplicon. The integrity of DNA in the 
5 sample is high when amplicon is produced in one or more of the amplification 
reactions. This method is especially useful for determining whether a 
heterogeneous sample has sufficient nucleic acid for measurement. Accordingly, 
such methods are used to screen or to "qualify" samples for further analysis (e.g., 
genetic, biochemical, cytological, or other analyses). 

10 Methods of the invention may also be used to assess fetal abnormalities by 

conducting amplification reactions on nucleic acids in maternal blood. Just as 
described above, the ability to amplify significant amounts of nucleic acid is an 
indicator of a genomic instability. A baseline for comparison of the extent of nucleic 
acid amplification can be amounts of nucleic acids from known normal samples. 

15 The amount of amplification obtained from fetal samples is placed on a continuum, 
and the investigator must analyze any given sample in terms of the amount of fetal 
nucleic acid produced in various disease states and in normal samples. 

Methods of the invention are useful as diagnostic screening methods. Often it 
is desirable to perform follow-up testing on a patient in order to confirm a suspected 

20 disease state. Such follow-up procedures are determined based upon the disease 
state being interrogated. For example, a colonoscopy may be suggested in a case 
in which a stool sample is positively screened according to methods of the invention. 
Appropriate follow-up with CT scan, PET scan, x-ray, ultrasound, ERCP, 
Endoscopy, MRI, virtual colonoscopy, biopsy, or other measures may be appropriate 

25 depending upon the diagnosis. 

Methods of the invention are useful as screens for a wide range of disease 
states. In addition to colon cancers and adenomas, methods of the invention are 
useful to screen for other diseases, for example, as screens for lymphomas, or 
stomach, lung, liver, pancreas, prostate, kidney, testicular, bladder, uterus, or 

30 ovarian cancers or adenomas. In addition to cancer, methods of the invention are 
useful, for example, as screens for diseases such as inflammatory bowel syndrome, 
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inflammatory bowel disease, Crohn's disease, and others in which a genomic 
instability is thought to play a role. Methods of the invention are especially useful as 
screens for any disease that impairs the proper function of the gastrointestinal 
system; most especially diseases of the colon. Methods of the invention are also 
5 useful to screen for apoptosis in a cellular sample. The profile of amplifiable DNA in 
a sample is correlated with proteins that have been associated with disease. For 
example up regulation of the apoptosis protein, survivin, is correlated with increased 
amounts of amplifiable DNA, as is the Ras oncogene, as well as other oncogenes 
and their gene products. 

10 Methods of the invention are also useful as assays for apoptosis. The 

presence of high-integrity fragments or large quantities of nucleic acids in a sample 
indicates that the sample was derived from cells that did not proceed through 
apoptosis. The absence of such fragments or quantities indicates that cells that 
contributed to the sample did undergo apoptosis. Accordingly, an apoptotic activity 

15 assay of the invention, either alone or in combination with other assays for genomic 
instability, are useful as screens for disease. 

Finally, methods of the invention can be carried out by hybrid capture. For 
example, hybrid capture and subsequent analysis of the captured fragments can be 
used to determine the nucleic acid integrity of a sample. 

20 The invention also provides a profile of nucleic acid fragments indicative of 

disease. A preferred profile is obtained through methods described above. 
Preferred profiles comprise nucleic acids having between about 200 bp and about 
2.4 Kb obtained in a patient sample comprising cellular debris according to methods 
described herein. A highly preferred profile contains at least one nucleic acid of at 

25 least 1.3 Kb. 

Other objects and advantages of the invention are apparent upon 
consideration of the following drawings and detailed description thereof. 
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Description of the Drawings 



Figure 1 is a gel photograph showing results of amplification of K-ras (exon 1) 
DNA isolated from stool using forward and reverse primers spaced about 200 bp 
5 apart. The band intensity relates to the amount of 200 bp product or greater in the 
sample. Lanes 1-4 are results from patients with cancer or adenoma, lane 5 is a 
positive control, lanes 6-10 are from patients who did not have cancer or adenoma, 
lanes 11-12 are negative controls, and lanes 13-18 are standards at the approximate 
molecular weight indicated in the figure. Amplifications were graded A through C, A 

10 being the most intense band, C being the least. 

Figures 2-4 are gel photographs showing results of amplification of ape (exon 
15) DNA isolated from stool using forward and reverse primers spaced about 200 bp 
apart. The band intensity relates to the amount of 200 bp product or greater in the 
sample. Lanes 1-4 are results from patients with cancer or adenoma, lane 5 is a 

15 positive control, lanes 6-10 are from patients who did not have cancer or adenoma, 
lanes 11-12 are negative controls, and lanes 13-18 are standards at the approximate 
molecular weight indicated in the figure. Amplifications were graded A through C, A 
being the most intense band, C being the least. 

Figure 5 is a gel photograph showing results of amplification of p53 (exon 5) 

20 DNA isolated from stool using forward and reverse primers spaced about 200 bp 
apart. The band intensity relates to the amount of 200 bp product or greater in the 
sample. Lanes 1-4 are results from patients with cancer or adenoma, lane 5 is a 
positive control, lanes 6-10 are from patients who did not have cancer or adenoma, 
lanes 11-12 are negative controls, and lanes 13-18 are standards at the approximate 

25 molecular weight indicated in the figure. Amplifications were graded A through C, A 
being the most intense band, C being the least. 

Figure 6 is a gel photograph showing results of amplification of p53 (exon 7) 
DNA isolated from stool using forward and reverse primers spaced about 200 bp 
apart. The band intensity relates to the amount of 200 bp product or greater in the 

30 sample. Lanes 1-4 are results from patients with cancer or adenoma, lane 5 is a 
positive control, lanes 6-10 are from patients who did not have cancer or adenoma, 
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lanes 11-12 are negative controls, and lanes 13-18 are standards at the approximate 
molecular weight indicated in the figure. Amplifications were graded A through C, A 
being the most intense band, C being the least. 

Figure 7 is a gel photograph showing results of amplification of p53 (exon 8) 
5 DNA isolated from stool using forward and reverse primers spaced about 200 bp 
apart. The band intensity relates to the amount of 200 bp product or greater in the 
sample. Lanes 1-4 are results from patients with cancer or adenoma, lane 5 is a 
positive control, lanes 6-10 are from patients who did not have cancer or adenoma, 
lanes 11-12 are negative controls, and lanes 13-18 are standards at the approximate 

10 molecular weight indicated in the figure. Amplifications were graded A through C, A 
being the most intense band, C being the least. 

Figure 9-10 are gel photographs of results of amplification of DNA from stool 
samples using forward and reverse primers spaced approximately 1 .8 Kb apart. The 
band intensity shows the amount of 1.8 Kb or greater product. Lanes 1, 8, and 9 are 

15 negative controls, lanes 2, 3, and 5 are results from patients with cancer or 

adenoma, lanes 4, 6, and 7 are results from patients who did not have cancer or 
adenoma, and lanes 10-14 are molecular weight standards. 

Figures 1 1 A and B are gel photographs of results of amplification of DNA in 
stool from a total of 30 patients and controls. The band intensity relates to the 

20 amount of amplifiable DNA in the sample. Lanes N are negative controls, lanes 1 , 3, 
11, and 18 are results from patients which are indicative of the presence off cancer 
or adenoma, lanes 2, 4, 5-10, 12-17, and 19-30 are results from patients which are 
indicative of the absence of cancer or adenoma. The remaining lanes are markers 
or standards. 

25 Figure 12 shows a schematic representation of the placement of the primers 

for amplification in a method of the present invention. In this method, a single 
forward primer, Fi, is used in conjunction with a series of reverse primers, Ri to R 6 , 
chosen to amplify progressively longer portions of the target. 

Figure 13 shows a schematic representation of the placement of the primers 

30 for amplification in a method of the present invention. In this method, a series of 
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forward and reverse primer pairs, (Fi , Ri) to (F 3 , R 3 ), are chosen to amplify portions 
of the target spaced at intervals along the target. 

Detailed Description of the invention 

5 The invention provides methods for the analysis of biological samples. 

Methods of the invention provide diagnostically-relevant information based upon the 
integrity of nucleic acids in a biological sample. Normal biological samples (those 
not having indicia of the disease being screened), especially those comprising 
luminal tissue and/or fluid, typically comprise a majority of short-fragment, low- 

10 integrity nucleic acids (especially DNA) which are the result of degradation by 
apoptosis. When a mutation has caused genomic instability, the normal cell cycle 
may be disrupted and apoptotic degradation may not occur at the rate expected in a 
normal sample. Methods of the invention screen for such disruptions. 

Accordingly, preferred methods of the invention comprise determining an 

15 amount of amplifiable nucleic acid in a biological sample, and determining whether 
that amount is consistent with an amount expected in a normal sample. In many 
biological samples, especially heterogeneous samples, there may be no detectable 
amplification product. That is especially true when longer fragments are used as 
templates for amplification. Generally, the probability that any given set of PCR 

20 primers will amplify a DNA fragment having a length exceeding the primer distance 
is expressed as 

% of Fragments Amplified = (FL-PD)/(FL+PD) 

25 wherein FL is fragment length (in base pairs) and PD is primer distance (in 

base pairs). This equation assumes that sample DNA fragment lengths are 
uniformly distributed (i.e., there is no favored locus at which breaks occur). 

In a preferred embodiment, methods of the invention comprise amplifying 
sequences of different length in a sample, if present, in order to generate a profile of 

30 amplification products indicative of disease or the propensity for disease. In a 

preferred method, a sample is exposed to a set of PCR primers comprising a single 
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forward primer, which may be a capture probe used to capture target fragments, and 
a plurality of downstream reverse primers which hybridize to portions of a contiguous 
sequence (if present) in the sample. Amplifications using these primers will result in 
a series of amplification products, each having a different length, if the contiguous 
5 target sequence is present in the sample. The length of the amplification products 
are determined by the spacings between the forward primer and each of the 
downstream reverse primers. An example is shown in Figure 12, which is a 
schematic representation showing placement of the primers for amplification. 

If the target sequence, or a portion of it, is present in the sample, amplification 

10 will result in a series of fragments the length of which is dictated by the spacing of 
the primers. According to the principles adduced above, a sample from a diseased 
patient will produce a profile of amplification products in the assay described above 
that differs from the profile obtained from a sample containing the smaller fragments 
expected to be produced as a result of normal apoptosis. In a preferred 

15 embodiment, the forward primer is designed to hybridize about 200 bp upstream of 
the first reverse primer, and about 2.3 Kb upstream of the last reverse primer. Other 
reverse primers are designed to hybridize at various locations between the first and 
last reverse primers. Preferred intervals between the forward primer and the various 
reverse primers are 200 bp (F^RO, 400 bp (Fi-R 2 ), 800 bp (F^Rs), 1.3 Kb, (F1-R4), 

20 1 .8 Kb (F1-R5), and 2.3 Kb (F r R 6 ). The number and spacing of reverse primers is 
chosen at the convenience of the skilled artisan. 

Also in a preferred embodiment, a hybrid capture probe is used to anchor a 
target sequence, preferably on a solid support (e.g., beads). A plurality of probes 
are then placed at various distances downstream of the capture probe. Those 

25 probes can be pairs of forward and reverse primers as discussed above, or they can 
be signal amplification probes, such as those used in Ligase Chain Reaction (LCR), 
and others used in the identification of sequences. The plurality of probes hybridize 
along the length of a target fragment if the target is present in the sample. Thus, by 
interrogating samples for the presence of the probes, one can determine the integrity 

30 of sequences present in the sample. This can be done in numerous ways, including, 
but not limited to, hybrid capture, PCR, LCR, strand displacement, branched chain, 
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or other assays known in the are that incorporate hybrid probes or primers in order 
to identify or quantitate sequence. A sample containing intact (high integrity) nucleic 
acids represents a positive screen according to the invention. In one embodiment, 
sample is placed into wells (e.g., on a 96 well plate) containing support-bound 
5 capture probe. The capture probe immobilizes a target sequence, if present in the 
sample. Probes that hybridize to sequence downstream of the capture probe 
(downstream probes) are placed into each well, such that each downstream probe is 
spaced a unique distance apart from the common capture probe, and each well 
contains only one type of downstream probe. Signal is then generated by, for 

10 example, amplification, or by standard ELISA procedure followed by amplification, or 
by LCR, or other methods mentioned above. The presence of signal in each well 
indicates the presence of sequence of at least the length between the capture probe 
and the downstream probe. In an alternative embodiment, each well receives 
multiple different downstream probes, which may be distinctly labeled, and the 

15 presence of label(s) is correlated with the length of sequence presence in the 
sample. 

A sample from a patient having, for example, cancer will produce amplicon 
between most or all of the primer pairs (depending, inter alia, on the length of the 
target fragments, on the spacing of the primers, and where on the target the primers 

20 are spaced). Such a profile represents a positive screen for disease or the 
propensity for disease. A sample from a patient who does not have indicia of 
disease results in little or no amplification product in the assay described above. In 
a negative screen there may be amplification of small (e.g., 200 bp) fragments but 
there should be no amplification of larger fragments (i.e., fragments resulting from 

25 amplification between the forward primer and spaced-apart reverse primers). In 
cancer diagnostics, the target fragment may optionally be an oncogene, a tumor 
suppressor, or any other marker associated with cancer. However, it is not 
necessary to use cancer-associated markers in methods of the invention, as such 
methods are based on the general recognition that samples indicative of disease 

30 contain a greater amount of intact nucleic acids and a greater amount of long 
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fragment nucleic acids. Accordingly, any convenient target nucleic acid locus may 
be used in the methods of the invention. 

The amplification reactions described above may be conducted according to 
any suitable or convenient protocol and the fragment size of the resulting 
5 amplification products (if any) may be determined by any suitable or convenient 
means. 

In an alternative embodiment, methods of the invention comprise conducting 
a series of amplification reactions on a contiguous nucleic acid target fragment, each 
application reaction comprising one forward primer and one reverse primer, such 

10 that pairs of forward and reverse primers are spaced at intervals on a contiguous 
fragment suspected to be in the sample. An example of this arrangement is shown 
in Figure 13. Preferably, the spacings between each forward and reverse primer 
pair are equivalent. In a positive screen, the assay described above will result in a 
series of same-size fragments for most if not all of the primer pairs. Such an array of 

15 amplification products evidences a contiguous target sequence indicative of disease 
(see above). A sample from a disease-free patient should produce little or no 
amplification product, but in any case will not produce the contiguous array of 
amplification products expected from a sample containing a relatively intact 
diagnostic target sequence. 

20 Each of the methods described above are based upon the principle that an 

intact nucleic acid, or a segment of an intact nucleic acid, in a sample is diagnostic. 
Thus, variations on the methods described above are contemplated. Such 
variations include the placement of primers, the number of primers used, the target 
sequence, the method for identifying sequences, and others. For example, in the 

25 method depicted in Figure 13, and described above, it is not necessary that the 
numbers of forward and reverse primers be equal. A forward primer may, for 
example, be used to amplify fragments between two reverse primers. Other 
variations in primer pair placement are within the skill in the art, as are details of the 
amplification reactions to be conducted. Finally, as represented in Figures 12 and 

30 13, capture probes may be used in methods of the invention in order to isolate a 
chosen target sequence. 
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The following examples provide further details of methods according to the 
invention. For purposes of exemplification, the following examples provide details of 
the use of the method if the present invention in colon cancer detection. 
Accordingly, while exemplified in the following manner, the invention is not so limited 
5 and the skilled artisan will appreciate its wide range of application upon 
consideration thereof. 

Exemplary Method for the Detection of Colon Cancer 

The following example relates to screening for colon cancer in voided stool 

10 samples. Based upon the principles upon which the invention is based (see above), 
the same analysis can be performed on other samples, such as those mentioned 
above, with the same results as shown herein. 

For the analysis of stool samples, preferred methods of the invention 
comprise obtaining at least a cross-sectional or circumferential portion of a voided 

15 stool -as taught in U.S. patent number 5,741 ,650, and co-pending, co-owned U.S. 
patent application serial number 09/059,718, both of which are incorporated by 
reference herein. While a cross-sectional or circumferential portion of stool is 
desirable, methods provided herein are conducted on random samples obtained 
from voided stool, which include smears or scrapings. Once obtained, the stool 

20 specimen is homogenized. A preferable buffer for homogenization is one that 
contains at least 16 mM ethylenediaminetetraacetic acid (EDTA). However, as 
taught in co-pending, co-owned U.S. patent application serial number 60/122,177, 
incorporated by reference herein, it has been discovered that the use of at least 150 
mM EDTA greatly improves the yield of nucleic acid from stool. Thus, a preferred 

25 buffer for stool homogenization comprises phosphate buffered saline, 20-1 00 mM 
NaCI or KCI, at least 150 mM EDTA, and optionally a detergent (such as SDS) and a 
proteinase (e.g., proteinase K). 

After homogenization, nucleic acid is preferably isolated from the stool 
sample. Isolation or extraction of nucleic acid is not required in all methods of the 

30 invention, as certain detection techniques can be adequately performed in 

homogenized stool without isolation of nucleic acids. In a preferred embodiment, 
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however, homogenized stool is spun to create a supernatant containing nucleic 
acids, proteins, lipids, and other cellular debris. The supernatant is treated with a 
detergent and proteinase to degrade protein, and the nucleic acid is phenol- 
chloroform extracted. The extracted nucleic acids are then precipitated with alcohol. 
5 Other techniques can be used to isolate nucleic acid from the sample. Such 

techniques include hybrid capture, and amplification directly from the homogenized 
stool. Nucleic acids can be purified and/or isolated to the extent required by the 
screening assay to be employed. Total DNA is isolated using techniques known in 
the art. 

10 

Screening Assay Protocol 

The size of human DNA fragments obtained above can be determined by 
numerous means. For example, human DNA can be separated using gel 
electrophoresis. A 3% agarose gel is prepared using techniques known in the art. 

15 See Ausubel et. al., Short Protocols in Molecular Biology, John Wiley & Sones, 
1 195, pgs. 2-23-2-24, incorporated by reference herein. The size of human DNA 
fragments is then determined by comparison to known standards. Fragments 
greater than about 200 bp provide a positive screen. While a diagnosis can be 
made on the basis of the screen alone, patients presenting a positive screen are 

20 preferably advised to seek follow-up testing to render a confirmed diagnosis. 

A preferred means for determining human DNA fragment length uses PCR. 
Methods for implementing PCR are well-known. In the present invention, human 
DNA fragments are amplified using human-specific primers. Amplicon of greater 
than about 200 bp produced by PCR represents a positive screen. Other 

25 amplification reactions and modifications of PCR, such as ligase chain reaction, 
reverse-phase PCR, Q-PCR, and others may be used to produce detectable levels 
of amplicon. Amplicon may be detected by coupling to a reporter (e.g. fluorescence, 
radioisotopes, and the like), by sequencing, by gel electrophoresis, by mass 
spectrometry, or by any other means known in the art, as long as the length, weight, 

30 or other characteristic of the amplicons identifies them by size. 
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Examples 

Experiments were conducted to determine whether characteristics of 
amplifiable DNA in stool were predictive of cancer or precancer in patients from 
whom stools samples were obtained. In the first experiment, the amount of 

5 amplifiable DNA was measured in each of several stool samples using PCR 
amplification to detect DNA fragments in the sample of at least 200 base pairs in 
length. The second experiment determined the amount of long fragments (greater 
than 200 base pair) in the same samples, and then determined ratios of long product 
to short product. The third experiment determined a profile of amplification products 

10 with nucleic acid fragment lengths of 200 bp, 400 bp, 800 bp, 1.3 Kb, 1.8 Kb and 2.4 
Kb. The fourth and fifth experiments were clinical studies correlating the integrity of 
nucleic acids in patient stool samples with overall patient disease status. 

EXAMPLE 1 

15 

Stool samples were collected from 9 patients who presented with symptoms 
or a medical history that indicated that a colonoscopy should be performed. Each 
stool sample was frozen. Immediately after providing a stool sample, each patient 
was given a colonoscopy in order to determine the patient's disease status. Based 

20 upon the colonoscopy results, and subsequent histological analysis of biopsy 

samples taken during colonoscopy, individuals were placed into one of two groups: 
normal or abnormal. The abnormal group consisted of patients with cancer or with 
an adenoma of at least 1 cm in diameter. Based upon these results, 4 of the 9 
patients were placed into the abnormal group. 

25 The samples were screened by hybrid capturing human DNA, and 

determining the amount of amplifiable DNA having at least 200 base pairs. Each 
frozen stool specimen, weighing from 7-33 grams, was thawed and homogenized in 
500 mM Tris, 16 mM EDTA, and 10 mM NaCI, pH 9.0 at a volume, to mass ratio of 
3:1 . Samples were then rehomogenized in the same buffer to a final volume-to- 

30 mass ratio of 20:1 , and spun in glass macro beads at 2356 xg. The supernatant was 
collected and treated with SDS and proteinase k. The DNA was then phenol- 
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chloroform extracted and precipitated with alcohol. The precipitate was suspended 
in 10 mM Tris and 1 mM EDTA (1 x TE), pH 7.4. Finally, the DNA was treated with 
Rnase. 

Human DNA was isolated from the precipitate by sequence-specific hybrid 
5 capture. Biotynilated probes against portions of the p53, K-ras, and ape genes were 
used. 

The K-ras probe was 
S'GTGGAGTATTTGATAGTGTATTAACCTTATGTGTGAC 3' (SEQ ID NO: 1). 

There were two ape probes: apc-1309 was 
10 S'TTCCAGCAGTGTCACAGCACCCTAGAACCAAATCCAG 3' (SEQ ID NO: 2), and 
apc-1378 was 5'CAGATAGCCCTGGACAAACAATGCCACGAAGCAGAAG 3' (SEQ 
ID NO: 3). 

There were four probes against p53, the first (hybridizing to a portion of exon 
5) was 5TACTCCCCTGCCCTCAACAAGATGTTTTGCCAACTGG3' (SEQ ID 
is NO:4), the second (hybridizing to a portion of exon 7) was 

5 , ATTTCTTCCATACTACTACCCATCGACCTCTCATC3 , (SEQ ID NO: 5), the third, 
also hybridizing to a portion of exon 7 was 

5'ATGAGGCCAGTGCGCCTTGGGGAGACCTGTGGCAAGC3 , (SEQ ID NO: 6); 
and finally, a probe against exon 8 had the sequence 

20 5'GAAAGGACAAGGGTGGTTGGGAGTAGATGGAGCCTGG3' (SEQ ID NO: 7). 

A 1 0 \x\ aliquot of each probe (20 pmol/capture) was added to a suspension 
containing 300 jjtl DNA in the presence of 310 jxl 6M GITC buffer for 2 hours at room 
temperature. Hybrid complexes were isolated using streptavid in-coated beads 
(Dynal). After washing, probe-bead complexes were suspended at 25° C for 1 hour 

25 in 0.1 x TE buffer, pH7.4. The suspension was then heated for 4 minutes at 85° C, 
and the beads were removed. 

Captured DNA was then amplified using PCR, essentially as described in 
U.S. Patent No. 4,683,202, incorporated by reference herein. Each sample was 
amplified using forward and reverse primers through 7 loci (Kras, exon 1 , APC exon 

30 15 (3 separate loci), p53, exon 5, p53, exon 7, and p53, exon 8) in duplicate (for a 
total of 14 amplifications for each locus). Seven separate PCRs (40 cycles each) 
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were run in duplicate using primers directed to detect fragments in the sample 
having 200 base pairs or more. Amplified DNA was placed on a 4% Nusieve (FMC 
Biochemical) gel (3% Nusieve, 1% agarose), and stained with ethidium bromide (0.5 
H.g/ml). The resulting amplified DNA was graded based upon the relative intensity of 

5 the stained gels. The results are shown in Figures 1-7. Each Figure represents the 
results for all 9 patients (including standards) for the seven different loci that were 
amplified. As shown in the Figures, each sample from a patient with cancer or 
adenoma was detected as a band having significantly greater intensity than the 
bands associated with samples from patients who did not have cancer or precancer. 

10 All four cancer/adenoma patients identified using colonoscopy were correctly 

identified by determining the amount of amplifiable DNA 200 base pairs or greater in 
length. As shown in Figures 1-7, the results were the same regardless of which 
locus was amplified. Accordingly, the amount of 200 bp or greater DNA in a sample 
was predictive of patient disease status. 

15 

EXAMPLE 2 

An experiment was conducted that was essentially identical to the one 
described above in Example 1 , but forward and reverse primers were placed such 
that fragments of about 1 .8 Kb and above were amplified. 

20 DNA was prepared as described above. Forward and reverse primers were 

spaced so as to hybridize approximately 1 .8 Kb apart on three different loci (Kras, 
exon 1, APC, exon 15, and p53 exon 5). Thirty-three rounds of amplification were 
performed, and the resulting DNA was placed on a 3% agarose gel. The results are 
shown in Figures 8-10. As shown in the Figures (which show results from three 

25 separate experiments to amplify and detect "long" product), samples from individuals 
having cancer or precancer produced large amounts of high-molecular weight (in 
this case 1 .8 Kb and above) DNA; whereas samples from patients who did not have 
cancer or precancer produced no DNA in the range of about 1 .8 Kb and higher. 
Thus, the presence of high-molecular weight DNA was indicative of the disease 

30 status of the patient. 
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EXAMPLE 3 

An experiment was conducted to determine the molecular weight profile of 
DNA from samples collected and prepared as part of a blind study on 30 patients 
who presented at the Mayo Clinic with suspected gastrointestinal disorders. Stool 
5 samples were obtained, and DNA was isolated as described above. 

Prior to amplification, DNA was isolated from the samples by hybrid capture. 
Biotynilated probes against portions of the BRCA1, BRCA2, p53, APC genes were 
used. 

The BRCA1 probe was 
10 S'GATTCTGAAGAACCAACTTTGTCCTTAACTAGCTCTTS' (SEQ ID NO: 8). 

The BRCA2 probe was 
S'CTAAGTTTGAATCCATGCTTTGCTCTTCTTGATTATTS' (SEQ ID NO 9). 

The APC 1 probe was 
5 , CAGATAGCCCTGGACAAACCATGCCACCAAGCAGAAG3 , (SEQ ID NO 10). 
15 The p53 probe, hybridizing to a portion of exon 5, was 

5TACTCCCCTGCCCTCAACAAGATGTTTTGCCAACTGG3 , (SEQ ID NO:4). 

The APC2 probe was 
5 , GAAGTTCCTGGATTTTCTGTTGCTGGATGGTAGTTGC3 , (SEQ ID NO 11). 

20 A 300 \x\ aliquot of sample was placed in 300 (il of 6 M guanidine 

isothiocyanate buffer with 10 jnl of each capture probe, and incubated overnight at 25 
C. Captured DNA was isolated using 100 \x\ capture beads incubated for one hour 
at room temperature. The DNA was eluted off the beads and PCR amplified under 
standard PCR conditions. 

25 According to methods of the invention, amplification reactions were 

conducted using forward and reverse primers through the 5 loci for each sample. 
Forward and reverse primers were spaced to amplify fragments of 200 bp, 400 bp, 
800 bp, 1 .3 Kb, 1 .8 Kb, and 2.4 Kb. Each of 30 PCR reactions was run for 36 
cycles. Amplicon was run on a 3% Seakeam gel, and stained with ethidium 

30 bromide. The results are shown in Figures 1 1 A and 1 1 B. Each figure represents 
the results for 15 of the 30 patients. 
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As shown in those figures, patients with cancer or adenoma have an 
increased yield of ampiifiable DNA. That is especially true at the 1 .8 Kb level and 
above. Thus, patients with cancer or adenoma not only produce more ampiifiable 
DNA in their stool, but also produce larger DNA fragments than are produced in the 
5 stool of patients who do not have cancer. Thus, both an increased yield of 

ampiifiable DNA and the presence of high molecular weight DNA, especially that at 
1 .8 Kb and above, were indicative of patient disease status. 

EXAMPLE 4 

io In this example, methods of the invention were correlated with clinical 

outcome in numerous patients who had a colorectal adenoma or colorectal cancer 
as diagnosed using colonoscopy, and 79 patients who were diagnosed as not 
having colorectal cancer or adenoma. A stool sample was obtained from each of 
these patients and prepared as described above. Fragments of the 5 different loci 

15 referred to above were amplified using primers spaced 200, 400, 800, 1300, 1800, 
and 2400 base pairs apart using the protocol described above in Example 3. Each 
amplification was scored such that successful amplification of a fragment received a 
score of 1 , and no amplification received a score of 0. Since five loci were 
interrogated using 6 primer pairs each, the maximum score was 30 (successful 

20 amplification of all 6 fragments at all five loci). The cutoff for a positive screen was 
set at 21 . The results are shown below. 
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Table 1 
Normals 



Patient No. 


Age 


Score 








P-178 


64 


19 


P-185 


50 


18 


P-033 


56 


16 


P-177 


67 


14 


P-055 


75 


13 


P-029 


70 


12 


P-079 


63 


12 


P-066 


72 


11 


P-027 


65 


10 


P-054 


72 


9 


P-158 


59 


9 


P-043 


56 


8 


P-009 


73 


7 


P-030 


86 


2 


P-032 


51 


1 


P-068 


58 


1 


P-187 


63 


1 


P-018 


68 


0 


P-186 


61 


17 


P-135 


67 


14 


P-120 


75 


13 


P-179 


76 


9 


P-057 


56 


7 


P-143 


65 


6 


P-136 


58 


1 


P-012 


75 


0 



Table 2 
Adenomas 



Patient No. 


Age 


Score 


P-003 




29 


P-001 




23 


P-045 




22 


P-162 




21 


P-163 




16 


P-088 




15 


P-050 




13 


P-060 




11 


P-061 




11 


P1058 




10 


P-075 




10 


P-077 




8 


P-024 




7 


P-056 




7 


P-067 




7 


P-025 




6 


P-080 




4 


P-123 




4 


P-048 




3 


P-040 




2 


P-006 




1 


P-004 




0 


P-015 




0 


P-083 




0 


P-047 






P-129 
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Table 3 
Carcinomas 



Patient No. 


Age 


Score 


P-064 




30 


P-103 




30 


P-104 




30 


P-108 




30 


P-101 




29 


P-102 




29 


P-099 




28 


P-107 




28 


P-110 




26 


P-098 




25 


P-134 




24 


P-062 




23 


P-090 




23 


P-095 




23 


P-093 




22 


P-100 




21 


P-122 




18 


P-084 




15 


P-109 




15 


P-118 




10 


P-138 




10 


P-091 




8 


P-096 




8 


P-053 




7 


P-119 




6 


P-117 




5 


P-105 




0 


P-097 







As shown above, methods of the invention are effective in screening for the 
presence of colorectal cancer and adenoma. 
EXAMPLE 5 

In this example, methods of the invention were used to detect non-colonic 
cancers in 28 patients. 

A stool sample was obtained from each of the 28 patients. The sample was 
prepared as described above. Fragments of the 5 different loci referred to above 
were amplified using primers spaced 200, 400, 800, 1300, 1800, and 2400 base 
pairs apart using the protocol described above in Example 3. Each amplification 
was scored such that successful amplification of a fragment received a score of 1 , 
and no amplification received a score of 0. Since five loci were interrogated using 6 
primer pairs each, the maximum obtainable score was 30 (successful amplification 
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of all 6 fragments at all five loci). A score of 21 was used as a cutoff between 
diseased and non-diseased patients. The results are shown below. 



Table 4 

5 Supercolonic Cancers 





Patient No. 


Supercolonic 
Cancer 


Age 


Score 




P-145 


Pancreas 


68 


30 




P-164 


Lung CA 


68 


30 




P-166 


Bile Duct 


52 


30 




P-189 


Bile Duct 


43 


30 




P-190 


Lung CA 


50 


30 




P-019 


Atypical 
Findings in 
Stomach 


71 


29 




P-152 


Lung CA 


77 


28 




P-167 


Pancreas 


72 


28 




P-011 


Lung CA 


73 


27 




P-153 


Pancreas 


65 


27 




P-165 


Lung CA 


85 


27 


5£af 


P-170 


Duodenum 


65 
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As shown above, methods of the invention successfully screened 18 out of 27 
patients who actually had non-colonic cancer. Only one patient was misdiagnosed 
10 as having cancer when he did not. Thus, the methods of the invention are useful for 
non-invasive diagnosis of a non-specified cancerous disease state in a patient. 

The threshold of 21 for a positive screen can be changed to accommodate 
desired sensitivities and specificities. For example, if 1 8 were the false negative 
results shown in Table 4 would be avoided. The skilled artisan knows how to set 
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thresholds depending on the patient (e.g., a lower threshold for patients with 
symptoms than patients presenting with no symptoms), the disease being 
diagnosed, and the desired level of sensitivity and specificity. Regardless of the 
threshold, the principle of the invention remains that nucleic acid integrity is a viable 

5 marker for disease, and especially for cancer. 

In addition, the propensity for disease may be measured using methods of the 
invention. For example, periodic molecular weight profiling in accordance with the 
methods of the invention may be used to monitor the disease state of a patient 
presenting no or minimal symptoms. Such longitudinal monitoring will determine 

10 whether a patient is progressing with increasing amounts of high integrity nucleic 
acids - indicating the desirability for follow-up examination. 
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What is claimed is: 

1 . A method for screening for a supercolonic disease in a patient, the method 
comprising the steps of: 
5 determining the integrity of nucleic acids in a patient sample comprising shed 

cells or cellular debris; and 

identifying said patient as having disease if intact nucleic acids are present in 
said sample in an amount greater than a predetermined threshold. 

10 2. The method of claim 1 , wherein said disease is cancer or pre-cancer. 

3. The method of claim 2, wherein said cancer is selected from the group 
consisting of lung cancer, esophageal cancer, prostate cancer, stomach cancer, 
pancreatic cancer, liver cancer, cancer of the bile duct, cancer of the duodenum, and 
15 lymphoma. 
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Abstract 

The present invention provides methods for detecting disease by analysis of a 
patient sample to determine the integrity of nucleic acids in the sample. 
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